Comparing N-gram Models for Tag Suggestion in Tagging System

نویسندگان

  • Hyunwoo Kim
  • Kangpyo Lee
  • Hyopil Shin
  • Hyoung-Joo Kim
چکیده

On the web, a tag is a significant keyword of content, including a photo, video, and blog article. A tagging is an action of adding a tag to content. Many web sites, such as del.icio.us 1 and CiteULike 2 , are providing tagging system. These tags can be used for web search. Users have already recognized the value and the importance of tags, but some users do not use tags. These users might feel annoyed to be forced to add tags, or they might simply not know what to add in order to obtain a good search result. These problems are the reasons why tag suggestion system would be beneficial. In this paper, we use n-gram models for tag suggestion in tagging system. We gathered tag data from various web sites. Based on crawled tag data, we will employ various n-gram models and compare obtained results in the next paper. This is a progress paper. KeywordsWeb 2.0; Folksonomy; Tag Suggestion; Tagging; Ngram; Natural Language Processing

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Beyond N in N-gram Tagging

The Hidden Markov Model (HMM) for part-of-speech (POS) tagging is typically based on tag trigrams. As such it models local context but not global context, leaving long-distance syntactic relations unrepresented. Using n-gram models for n > 3 in order to incorporate global context is problematic as the tag sequences corresponding to higher order models will become increasingly rare in training d...

متن کامل

Tagging syllable boundaries with joint n-gram models

This paper presents a statistical method for the segmentation of words into syllables which is based on a joint n-gram model. Our system assigns syllable boundaries to phonetically transcribed words. The syllabification task was formulated as a tagging task. The syllable tagger was trained on syllableannotated phone sequences. In an evaluation using ten-fold cross-validation, the system correct...

متن کامل

Content-based and Graph-based Tag Suggestion

Social tagging is a popular and convenient way to organize information. Automatic tag suggestion can ease the user’s tagging activity. In this paper, we exam both content-based and graph-based methods for tag suggestion using the BibSonomy dataset, and describe our methods for ECML/PKDD Discovery Challenge 2009 submissions . In content-based tag suggestion, we propose a fast yet accurate method...

متن کامل

Tripartite Hidden Topic Models for Personalised Tag Suggestion

Social tagging systems provide methods for users to categorise resources using their own choice of keywords (or “tags”) without being bound to a restrictive set of predefined terms. Such systems typically provide simple tag recommendations to increase the number of tags assigned to resources. In this paper we extend the latent Dirichlet allocation topic model to include user data and use the es...

متن کامل

TagAssist: Automatic Tag Suggestion for Blog Posts

In this paper, we describe a system called TagAssist that provides tag suggestions for new blog posts by utilizing existing tagged posts. The system is able to increase the quality of suggested tags by performing lossless compression over existing tag data. In addition, the system employs a set of metrics to evaluate the quality of a potential tag suggestion. Coupled with the ability for users ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009